136 research outputs found
Unsupervised Induction of Modern Standard Arabic Verb Classes
We exploit the resources in the Arabic Treebank (ATB) for the novel task of automatically creating lexical semantic verb classes for Modern Standard Arabic (MSA). Verbs are clustered into groups that share semantic elements of meaning as they exhibit similar syntactic behavior. The results of the clustering experiments are compared with a gold standard set of classes, which is approximated by using the noisy English translations provided in the ATB to create Levin-like classes for MSA. The quality of the clusters is found to be sensitive to the inclusion of information about lexical heads of the constituents in the syntactic frames, as well as parameters of the clustering algorithm. The best set of parameters yields an Fβ=1 score of 0.501, compared to a random baseline with an Fβ=1 score of 0.37
Locality and Accessibility in Wh-Questions
Even in relatively configurational languages, such as English, speakers frequently have a choice between different constituent orders. Many of these word order variations have been linked to complexity (Hawkins 2005; inter alia). For example, heavy-NP shift is more likely if the shifted NP is more complex than the NP it shifts over (Wasow 1997). Other cases of word order variations, however, have not been considered in these terms. The choice between different wh-phrase orders, as in (1), has been said to be determined by (categorical) grammatical constraints, such as Superiorit
ACI-BENCH: a Novel Ambient Clinical Intelligence Dataset for Benchmarking Automatic Visit Note Generation
Recent immense breakthroughs in generative models such as in GPT4 have
precipitated re-imagined ubiquitous usage of these models in all applications.
One area that can benefit by improvements in artificial intelligence (AI) is
healthcare. The note generation task from doctor-patient encounters, and its
associated electronic medical record documentation, is one of the most arduous
time-consuming tasks for physicians. It is also a natural prime potential
beneficiary to advances in generative models. However with such advances,
benchmarking is more critical than ever. Whether studying model weaknesses or
developing new evaluation metrics, shared open datasets are an imperative part
of understanding the current state-of-the-art. Unfortunately as clinic
encounter conversations are not routinely recorded and are difficult to
ethically share due to patient confidentiality, there are no sufficiently large
clinic dialogue-note datasets to benchmark this task. Here we present the
Ambient Clinical Intelligence Benchmark (ACI-BENCH) corpus, the largest dataset
to date tackling the problem of AI-assisted note generation from visit
dialogue. We also present the benchmark performances of several common
state-of-the-art approaches
The source ambiguity problem: Distinguishing the effects of grammar and processing on acceptability judgments
Judgments of linguistic unacceptability may theoretically arise from either grammatical deviance or significant processing difficulty. Acceptability data are thus naturally ambiguous in theories that explicitly distinguish formal and functional constraints. Here, we consider this source ambiguity problem in the context of Superiority effects: the dispreference for ordering a wh-phrase in front of a syntactically “superior” wh-phrase in multiple wh-questions, e.g., What did who buy? More specifically, we consider the acceptability contrast between such examples and so-called D-linked examples, e.g., Which toys did which parents buy? Evidence from acceptability and self-paced reading experiments demonstrates that (i) judgments and processing times for Superiority violations vary in parallel, as determined by the kind of wh-phrases they contain, (ii) judgments increase with exposure, while processing times decrease, (iii) reading times are highly predictive of acceptability judgments for the same items, and (iv) the effects of the complexity of the wh-phrases combine in both acceptability judgments and reading times. This evidence supports the conclusion that D-linking effects are likely reducible to independently motivated cognitive mechanisms whose effects emerge in a wide range of sentence contexts. This in turn suggests that Superiority effects, in general, may owe their character to differential processing difficulty
Status Report of the DPHEP Study Group: Towards a Global Effort for Sustainable Data Preservation in High Energy Physics
Data from high-energy physics (HEP) experiments are collected with
significant financial and human effort and are mostly unique. An
inter-experimental study group on HEP data preservation and long-term analysis
was convened as a panel of the International Committee for Future Accelerators
(ICFA). The group was formed by large collider-based experiments and
investigated the technical and organisational aspects of HEP data preservation.
An intermediate report was released in November 2009 addressing the general
issues of data preservation in HEP. This paper includes and extends the
intermediate report. It provides an analysis of the research case for data
preservation and a detailed description of the various projects at experiment,
laboratory and international levels. In addition, the paper provides a concrete
proposal for an international organisation in charge of the data management and
policies in high-energy physics
Supporting Emirati females leadership skills through teaching them how to debate: Design, assessment, and considerations
© 2016 Elsevier Ltd. In response to the emerging need in the United Arab Emirates to empower young women and prepare them for future leadership tasks, a debate teaching intervention was organized in two phases at a public University in Dubai. During that intervention, 137 female Emirati students were taught the basics of debate and then participated in a debate session on a topic of general interest (Dubai EXPO 2020). Results show that participants observe a clear change in how they perceive themselves as leaders as a result of the intervention. Moreover, their leadership discourse as measured in terms of the persuasiveness of their expressed arguments at a group level was seen to improve more when the debate format followed had a formal structure than when it was flexible. Implications are discussed regarding the transformative learning function of debate as a training tool and its effect on leadership self-efficacy
Critical Exponents, Hyperscaling and Universal Amplitude Ratios for Two- and Three-Dimensional Self-Avoiding Walks
We make a high-precision Monte Carlo study of two- and three-dimensional
self-avoiding walks (SAWs) of length up to 80000 steps, using the pivot
algorithm and the Karp-Luby algorithm. We study the critical exponents
and as well as several universal amplitude ratios; in
particular, we make an extremely sensitive test of the hyperscaling relation
. In two dimensions, we confirm the predicted
exponent and the hyperscaling relation; we estimate the universal
ratios , and (68\% confidence
limits). In three dimensions, we estimate with a
correction-to-scaling exponent (subjective 68\%
confidence limits). This value for agrees excellently with the
field-theoretic renormalization-group prediction, but there is some discrepancy
for . Earlier Monte Carlo estimates of , which were , are now seen to be biased by corrections to scaling. We estimate the
universal ratios and ; since , hyperscaling holds. The approach to
is from above, contrary to the prediction of the two-parameter
renormalization-group theory. We critically reexamine this theory, and explain
where the error lies.Comment: 87 pages including 12 figures, 1029558 bytes Postscript
(NYU-TH-94/09/01
Search for Kaluza-Klein Graviton Emission in Collisions at TeV using the Missing Energy Signature
We report on a search for direct Kaluza-Klein graviton production in a data
sample of 84 of \ppb collisions at = 1.8 TeV, recorded
by the Collider Detector at Fermilab. We investigate the final state of large
missing transverse energy and one or two high energy jets. We compare the data
with the predictions from a -dimensional Kaluza-Klein scenario in which
gravity becomes strong at the TeV scale. At 95% confidence level (C.L.) for
=2, 4, and 6 we exclude an effective Planck scale below 1.0, 0.77, and 0.71
TeV, respectively.Comment: Submitted to PRL, 7 pages 4 figures/Revision includes 5 figure
Measurement of the average time-integrated mixing probability of b-flavored hadrons produced at the Tevatron
We have measured the number of like-sign (LS) and opposite-sign (OS) lepton
pairs arising from double semileptonic decays of and -hadrons,
pair-produced at the Fermilab Tevatron collider. The data samples were
collected with the Collider Detector at Fermilab (CDF) during the 1992-1995
collider run by triggering on the existence of and candidates
in an event. The observed ratio of LS to OS dileptons leads to a measurement of
the average time-integrated mixing probability of all produced -flavored
hadrons which decay weakly, (stat.)
(syst.), that is significantly larger than the world average .Comment: 47 pages, 10 figures, 15 tables Submitted to Phys. Rev.
- …